Processing Information Graphics in Multimodal Documents

نویسندگان

  • Richard Burns
  • Sandra Carberry
  • Stephanie Elzer Schwartz
چکیده

Information graphics, such as bar charts, grouped bar charts, and line graphs, are an important component of multimodal documents and cannot be ignored. When such graphics appear in popular media, such as magazines and newspapers, they generally have an intended message. We argue that this message represents a brief summary of the graphic’s high-level content, and thus can serve as the basis for more robust information extraction from multimodal documents. The paper describes our methodology for automatically recognizing the intended message of an information graphic, with a focus on grouped bar charts.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Directional Stroke Width Transform to Separate Text and Graphics in City Maps

One of the complex documents in the real world is city maps. In these kinds of maps, text labels overlap by graphics with having a variety of fonts and styles in different orientations. Usually, text and graphic colour is not predefined due to various map publishers. In most city maps, text and graphic lines form a single connected component. Moreover, the common regions of text and graphic lin...

متن کامل

Tracing Integration of Text and Pictures in Newspaper Reading

Newspapers and net papers are complex multimodal documents consisting of texts, pictures and graphics. Although we encounter such documents in our everyday life, there is still little empirical evidence about how these formats are processed. The question is how readers interact with these formats, combine information from all of the available sources and create coherence. In a naturalistic news...

متن کامل

Toward Extractive Summarization of Multimodal Documents

Summarization research has focused on text, and relatively little attention has been given to the summarization of multimodal documents. If extractive summarization techniques are to be used on multimodal documents containing information graphics (bar charts, line graphs, etc.), then a strategy must be devised both for extracting the high-level content of the information graphics and for identi...

متن کامل

Semantic Modeling of Multimodal Documents for Abstractive Summarization

We describe a method for semantic modeling of multimodal documents and discuss how this can be used to generate an abstractive summary. Information extracted from the text by a semantic parser and from the graphics by a graph understanding system is combined into a single knowledge base. By operating at the semantic (rather than the surface) level, we are able to integrate information collected...

متن کامل

Semi-automated annotation of page-based documents within the Genre and Multimodality framework

This paper describes ongoing work on a tool developed for annotating document images for their multimodal features and compiling this information into a corpus. The tool leverages open source computer vision and natural language processing libraries to describe the content and structure of multimodal documents and to generate multiple layers of XML annotation. The paper introduces the annotatio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008